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Abelian bordered factors and periodicity 


Emilie Charlier* Tero Harju| Svetlana Puzyninaland Luca Q. Zamboni^ 


Abstract 

A finite word u is said to be bordered if u has a proper prefix which is also a suffix of m, 
and unbordered otherwise. Ehrenfeucht and Silberger proved that an infinite word is purely 
periodic if and only if it contains only finitely many unbordered factors. We are interested 
in abelian and weak abelian analogues of this result; namely, we investigate the following 
question(s): Let w be an infinite word such that all sufficiently long factors are (weakly) 
abelian bordered; is w (weakly) abelian periodic? In the process we answer a question of 
Avgustinovich et al. concerning the abelian critical factorization theorem. 


1 Introduction 

A finite word u is bordered^ if there exists a non-empty word z which is a proper prefix and 
a suffix of u. The following theorem states a well-known connection between periodicity and 
unbordered factors: 

Theorem 1. [6l ITT] An infinite word w is purely periodic if and only if there exists a constant 
C such that every factor v of w with |u| > C is bordered. 

In this paper we are interested in extending this result to the abelian and weak abelian settings. 

Two finite words u and v are abelian equivalent if and only if for each letter a, the number of 
occurrences of a in u (denoted |u|a) is equal to |u|a. In other words, u and v are permutations of 
one another. An infinite word w is called abelian ultimately periodic if w = UV1V2V3 ■ ■ ■, where 
the Vi are pairwise abelian equivalent. 

An infinite word w is called weakly abelian ultimately periodic if w = UV1V2V3 • • •, where Vi are 
finite words with the same frequencies of letters for i > 1 . In other words, a word is weakly 
abelian ultimately periodic if it can be factored into some prefix u followed by infinitely many 
words of possibly different lengths with the same letter frequencies. For more on weak abelian 
periodicity see [2]. 

We say that a finite word u is abelian bordered if u contains a non-empty proper prefix which is 
abelian equivalent to a suffix of u. Abelian bordered words were recently considered in UM- A 
finite word u is weakly abelian bordered if it has a non-empty proper prefix and a suffix with the 
same letter frequencies. 

In Sections [3] and m we consider weak abelian borders in the binary and non-binary cases, respec¬ 
tively. We prove that the condition on finitely many weak abelian unbordered factors implies 
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weak abelian periodicity plus some additional restrictions (Theorems [2] and [3|) . In Section O we 
consider infinite words having only finitely many abelian unbordered factors. We do not know 
if such words are necessarily abelian periodic. However, we are able to show that such words 
have bounded abelian complexity. In Section [6l we provide an answer to a question from [T] 
concerning abelian critical factorization theorem. In the last section we summarize the results 
and propose some open problems. 


2 Preliminaries 

In this section we give some basics on words following terminology from [8] and introduce our 
notions. 

Given a finite non-empty set S (called the alphabet), we denote by S* and respectively, the 
set of finite words and the set of (right) infinite words over the alphabet S. Given a finite word 
u = uiU 2 • • - Un with n > 1 and Ui G S, we denote the length n of rt by |u|. The empty word 
will be denoted by e and we set |e| = 0. Given the words w, x, y, z such that w = xyz, x is 
called a prefix, y is a factor and z a suffix of w. The word x is a proper prefix if 0 < |x| < |r(;|. 
We let w[i,j] = WiWi+i ■ ■ ■ Wj denote the factor starting at position i and ending at position j of 
w = W 1 W 2 W 3 • • •, where the Wk G S. An infinite word w is ultimately periodic (or briefly periodic) 
if for some finite words u and v it holds w = uv‘^ = uvv • • •; rc is purely periodic ii u = e. An 
infinite word is aperiodic if it is not ultimately periodic. Given a finite word u = U 1 U 2 • • - Un 
with n > 1 and Ui G S, for each a G S, we let \u\a denote the number of occurrences of the 
letter a in u. Two words u and v in S* are abelian equivalent, denoted u ^ab v, if and only if 
\'U‘\a = I'^la for all a € T,. It is easy to see that abelian equivalence is an equivalence relation 
on S*. An infinite word w is called abelian ultimately periodic (or briefly abelian periodic) if 
w = UV 1 V 2 V 3 ■ ■ ■, where Vi ^ab Vj for all integers i,j > 1. 

For a finite non-empty word w G T,*, we define the frequency Pa{w) of a letter a G S in rc as 
Pa{w) = For an infinite word w G and a letter a G S, if the limit 

lim pa{w[l..n]) 

n^oo 

exists, then we define the frequency Pa{w) of a in u) to be the latter limit. The infinite word w 
has uniform letter frequencies if, for every letter a of w, the ratio has a limit Pa{w) 

when n —>■ 00 , uniformly in A;. In general the existence of uniform frequencies is stronger than 
(prefix) frequencies, but in our consideration in fact all the words have uniform frequencies. 

Now we provide some background on weak abelian periodicity from [2]. 

Definition 1. An infinite word w over an alphabet S is called weakly abelian ultimately periodic 
(or briefly weakly abelian periodie) if rc = UV 1 V 2 V 3 ■ ■ ■, where Pa{vi) = Pa{vj) for all a G S and 
all integers i,j > I. 

An infinite word w is called bounded weakly abelian periodic, if it is weakly abelian periodic with 
bounded lengths of blocks, i.e., there exists C such that for every i we have \vi\ < C. 

Example 1. The word (01)(0^1^)(0^1^)(0^1^) • • • is weakly abelian periodic but not bounded 
weakly abelian periodic. 

We make use of the following geometric interpretation of weak abelian periodicity. We translate 
an infinite word w = W 1 W 2 • • • G to a graph visiting points of the lattice by interpreting 
letters of w as drawing instructions. In the binary case we associate 0 with a move by the vector 
vq = (I,—I), and 1 with a move by vi = (1,1). We start at the origin (xo,yo) = (0,0). At 



step n, we are at a point {xn-i,yn-i) and we move by a vector corresponding to the letter Wn, 
so that we come to a point {xn,yn) = {xn-i,yn-i) + and the two points {xn-i,yn-i) and 
{xn,yn) are connected with a line segment. We let denote the corresponding graph, and 
Guii'IT'') — {Xn^yn}- 

Lemma 1. A weakly abelian periodic word w has a graph with infinitely many integer points on 
some line with a rational slope. 

We remark that instead of the vectors (1,-1) and (1,1), one could use any other pair of non- 
collinear vectors vq and vi. For a /c-letter alphabet one can consider a similar graph in with 
linearly independent integer vectors vi,..., v^. For example, one can take as Vj a unit vector 
with 1 at position i and all other coordinates equal to 0. 

In the binary case and for vectors vq = (1, —1) and vi = (1,1), it will be convenient for us to 
use a function g^'. yn, so that Guj{n) = {n,gw{n)). It can also be regarded as a piecewise 
linear function with line segments connecting integer points (see an example on Figure 1). 



Figure 1: The graph of the Thue-Morse word with vq = (1,-1), vi = (1,1). 


3 Weak abelian borders: binary alphabet 

We proceed with the following theorem relating weak abelian borders and weak abelian period¬ 
icity, giving one way analog of the characterization of periodicity in terms of unbordered factors 
(Theorem [1]) : 

Theorem 2. Let w he an infinite binary word. If there exists a constant C such that every 
factor V of w with |u| > G is weakly abelian bordered, then w is bounded weakly abelian periodic. 
Moreover, its graph lies between two rational lines and has points on each of these two lines with 
bounded gaps. 

Remark that following the terminology from [2], a bounded weakly abelian periodic word is 
necessarily of bounded width. Namely, a word w is of bounded width if there exist two lines 
with the same slope, so that the graph of w lies between these two lines. Formally, there exist 
numbers a, (!,(!' so that ax -\- fi < gw{x) < ax + ff. We note that in the case of weak abelian 
periodicity a must be rational. 

We will also need the notion of balance. A word w G YA’ is called K-balanced, if for each letter a 
and two factors u,voiw such that |m| = |u| the inequality Hrija — |u|a| < K holds. We simply say 
that w is balanced if it is iF-balanced for some K. We remark that balance implies the existence 
of letter frequencies: 

Proposition 1. Proposition 2.4] An infinite word w G is balanced if and only if it has 
uniform letter frequencies and there exists a constant B such that for any factor u of w and for 
all letters a€Y, we have ||ri|a — pa{w)\u\\ < B. 

In fact, the latter proposition implies that balance is equivalent to the bounded width property. 
Indeed, the bounded width property implies balance. The converse is obtained by applying the 
above proposition to the prefixes of the word. 







































Corollary 1. An infinite word is balanced if and only if it is of bounded width. 

The proof of Theorem [2] heavily relies on the graph representation of the word. It consists of 
several lemmas restricting the form of the word. We first prove that if a factor w[i + l..j] of w is 
such that the graph of the word between i and j lies above or below the line segment connecting 
the points {i,gw{i)) and {j,gwij)), then w[i + l..j] is weakly abelian unbordered: 

Lemma 2. Letw be an infinite binary word, andi, j be integers, i < j. If gw{k) > fc+ 

eac/i i < k < j, then the factor w[i + l..j] is weakly abelian unbordered. 

We also note that a symmetric assertion holds for the case when the graph lies below this line: 
simply invert the inequality in the statement of the lemma. 



Figure 2: Illustration of Lemma [2l weakly abelian unbordered factor IIOIO. 


Proof. For any proper prefix p and any proper suffix s of u)[i + l..j] we have pi{p) — poip) > 
aiid pi{s) — po{s) < . Hence there is no weak abelian border. □ 

However, the reciprocal of the previous lemma does not hold. A counterexample is given by the 
weakly abelian unbordered word 1110000011000: see Figure [3l 



Figure 3: The reciprocal of Lemma [2] is not true: weakly abelian unbordered factor 
1110000011000 . 


Lemma 3. Let w be an infinite binary word with finitely many weakly abelian unbordered factors. 
Then w is balanced. 


We remind that by balance we mean A'-balance for some K. 


Proof. Suppose the converse. Unbalance implies that for every K there exist i, j, 
such that 


9w (0 


9w{j) - gwji) ^ _ 9w{i)j - 9w{j)i 
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> K. 
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In other words, the point (fig^j^l)) lies at distance at least K from the line connecting the points 
i and j (distance measured vertically). Indeed, unbalance implies that there exist two factors u 
and V of the same length in which the number of occurrences of the letter 0 (and hence also the 















































letter 1) differs by more than K. Consider the factor w[i + \..j] starting with u and ending with 
V. It is not hard to see that it satisfies the inequality m for either I = i + \u\ or / = j — 

Take the shortest such factor (i.e., choose i, j satisfying the condition m with the smallest 
j — i). Then the graph does not intersect the line segment connecting the points 
and {j,gw{j)), for otherwise there exists a shorter factor satisfying ([1]). Hence due to Lemma O 
it remains to notice that the lengths of such factors grow as K grows (in fact, j — i > K). □ 

Since balance implies the existence of letter frequencies, Lemma [3] implies that a word satisfying 
the conditions of Theorem [2] has letter frequencies. We will now prove that the frequencies are 
rational. 

Lemma 4. Let w he an infinite binary word with finitely many weakly abelian unbordered factors. 
Then the letter frequencies are rational. 

Proof. Balance implies bounded width, i.e., there exist a, fi', fi" such that the graph of w lies 
between two lines: ax + fi' < gw{x) < ax + fi". Here a is related to the letter frequencies as 
follows: po = and pi = So, we need to prove that a is rational. Assume the converse. 
Take the lowest (in the sense of the smallest /3) line y = ax + fi such that there are points of 
the graph arbitrarily close to it. We build the line (and prove its existence) as follows. 

Take e > 0, divide the stripe ax + fi' <y < ax + fi” into stripes of width at most e (width can be 
measured vertically, although this is not important). At least one of the stripes contains infinitely 
many points {i,gw{i)) of the graph Gy^. Choose the lowest such stripe ax + l3[ < y < ax + fif. 
Divide it into stripes of width si < e, continue the process with —)■ 0. The value of /3 in the 
desired line is obtained as a limit fi = lim^^oo fin = hiUn-j-oo fin- To prove that a is rational, we 
consider three cases: 

Case 1: There are no points of the graph Gy, below the line y = ax + fi. In this case take 
e > 0 and consider the stripe ax + /3 < y < ax + fi + e. There are infinitely many points in this 
stripe by the definition of the line y = ax + fi. Consider any two consecutive points {ni, gy,{ni)) 
and (n 2 , gy,(n 2 )) from the stripe (consecutive in the sense that for any n, ni < n < n 2 , we have 
9 w{n) > ax + fi + e). By Lemma [21 the factor w[ni + l..n 2 ] is weakly abelian unbordered, since 
the points of the graph Gy, between ni and n 2 lie above the line y = ax + fi + e, which in turn 
lies above the line segment connecting the points (ni,y^(ni)) and (n 2 , 510 (^ 2 )). 

Now we will prove that the minimal possible length (n 2 — nfij grows as e —>■ 0. Indeed, take 
the shortest factor w[ni + l..n 2 ] with ni, n 2 consecutive points in the stripe such that the slope 
of the line segment connecting the points (ni,y^(ni)) and {n 2 , gw{n 2 )) is the closest to a, i.e., 
\a — I 11 is minimal among the factors of the length { 1 x 2 — ni) with both ends in the 

stripe. If there are several such factors, choose one of them, say the leftmost. Since a is irrational, 
it is not equal to the slope of the line connecting points (ni,y^(ni)) and {n 2 ,gw{n 2 )). So, take 
fi < I \gw{n 2 ) — gw{ni) \ — a{n 2 — n.i)|, then no two points of the graph corresponding to positions 
in the word at distance (n 2 — ni) can simultaneously be in the stripe ax + fi < y < ax + fi + s'. 
So, the minimal distance between points in the new (thinner) stripe of width s' increased, which 
corresponds to weak abelian unbordered factors of length more than 77-2 —ni. Continuing this line 
of reasoning and taking smaller and smaller e, we get arbitrary long weakly abelian unbordered 
factors. 

Case 2: There are finitely many points of the graph Gy, below the line y = ax + fi (but the 
number of such points is nonzero). In this case take the last point (no, y«)(no)) below the line: 
gw{no) < ano + fi, and gw{n) > an + fi for each n > no- Consider a prefix of length N > uq oi 
w, and the part of the graph between no and N. Choose a point n', no < n' < N, such that the 
point {n',gy,{n')) is the closest to the line among the points (n,y^(n)), no < n < N: 

\gwin') - an'- fi\ = min \gy,{n) - an - fi\. 

no<n<N 



Then due to Lemma [21 the factor w[nQ + is weak abelian unbordered. Indeed, the points 
of the graph Gu, between no and n' lie above the line y = ax + 5 u,(n') — an' (the line parallel to 
y = ax + (3 and passing through the point {n', g^in'))), which in turn lies above the line segment 
connecting the points (no, S'«;(no)) and {n', g^{n')): 


.X. , , ,x 9w{n') - gw{no) , gwino)n'- gy,{n')no 

gw{n) >an + gw{n ) — an > -;-n + 


n' — no 


n' — no 


for no < n < nh Since there are points of the graph arbitrary close to the line y = ax + /?, 
then, increasing N, we will find larger and larger values n' and hence find arbitrary long weakly 
abelian unbordered factors. 

Case 3: There are infinitely many points of the graph below the line y = ax + /3. Since 
we chose the lowest line with the property that there are infinitely many points arbitrary close 
to the line, it follows that for any e > 0 there are only finitely many points below the line 
y = ax + (3 — £ (if any). Take e such that there are points below the line y = ax + (3 — e. By the 
condition of Case 3 there are infinitely many points in the stripe ax + (3 — £ < y < ax + (3. For 
each two consecutive points in the stripe the corresponding factor is weakly abelian unbordered 
due to Lemma 121 As e —0, the lengths of such factors grow for irrational a (the proof of this 
is similar to the proof of the similar fact from Case 1). 

Therefore, we came to a contradiction in each case. So the frequency must be rational. □ 


Proof of Theorem O Lemmas [3] and 0] imply that the graph of the word lies between two lines 
of rational slope a: an + f3' < gwin) < an + (3" for each n. We can choose such lines with the 
minimal /3" and maximal /3h To prove the theorem it remains to show that the graph of the 
word has points on each of these two lines with bounded gaps. We provide a proof for the line 
y = ax + [3', for the other line the proof is symmetric. 

First we prove that there are points of the graph on the line. Clearly, for rational a there is 
a fixed minimal distance from the integer points of the lattice to the line (note that for 
irrational slope this does not hold). If there are no points of the graph on the line, since the 
slope is rational, instead oi y = ax 3- /3' we could choose a different line y = ax -h (3'" with the 
same slope passing through the point of the graph closest to the line, and with (3'" > f3', which 
contradicts the choice of /3' as the maximal possible. 

Secondly, there are infinitely many points on the line. Otherwise consider a line y = ax + f3'" 
with (3'" > 13', with infinitely many points of the graph on the line and with the smallest such 
(3'". Note that here we need again a to be rational. Then there exists no such that for all 
n > no we have gwin) > an + (3'", i.e., starting from no the graph lies above the line. Taking the 
smallest possible no satisfying the condition, we get that the point (no,5«,(no)) lies below the 
line. Hence for each n' > no corresponding to a point on the line, i.e., with gwin') = an' + (3'", 
we have that the factor w\nQ + l..n'] is weakly abelian unbordered by Lemma[21 Indeed, all the 
points between no and n" lie above the line y = ax + (3'" and hence above the line segment 
connecting the points (no, 5'w,(no)) and in', gwin')) of the graph corresponding to the beginning 
and the end of the factor. 

Finally, we need to prove that the points on the line y = ax +13' appear in bounded gaps. This 
follows from the fact that for two consecutive points ni, n 2 on the line y = ax + (3' the factor 
w[ni + l..n 2 ] is weakly abelian unbordered by Lemma|21 □ 

Remark 1. Except for the case po = Pi = 1/2 (equal frequencies), we do not know whether 
the converse of Theorem 2 is true (see Question [2|) . In the case of equal frequencies the converse 
holds, and it could be easily seen from the graph of the word. In fact, we will prove that all 
sufficiently long factors have weak abelian borders with frequencies 1/2. Frequencies equal to 
1/2 correspond to a = 0, so the graph lies between two horizontal lines y = (3' and y = j3", 




and intersects each of them with bounded gaps. Denote the maximal gap by C. Consider a 
factor w[i + l..j] of length > C: j — i > C. Then there exist n',n", i < n',n" < j, such that 
Owin') = /3', Owin'') = (3". Without loss of generality n' < n" . For each integer P, /3' < /3 < f3", 
there exists n, n' < n < n", such that Owin) = /?• In other words, in the interval [n',n"] the 
graph passes through any integer horizontal line between (3' and f3". In particular, there exist 
i', j', i <n' < i',j' < n" < j, such that gwii) = Owii'), Owij) = Owif)- So, the prefix w[i + 
and the suffix w[j' + l..j] give a weak abelian border with the frequencies 1/2. But we do not 
see how to generalize this observation to the case of nonequal frequencies, since in general the 
graph can “jump” through several rational non-horizontal lines and it does not have to pass 
through all rational lines between points on lower and upper lines. 


4 Weak abelian borders: nonbinary alphabet 

Now we continue with non-binary alphabets and prove the analogue of Theorem [2j Basically, a 
non-binary word with finitely many weakly abelian unbordered factors must also be balanced, 
weakly abelian periodic and have rational letter frequencies. To state the analogue of the 
additional condition in Theorem [2] (graph lying between two parallel rational lines and bounded 
weak abelian periodicity along those lines) we need some notation. 

First, in the non-binary case, instead of bounded width we define cylinder set. Let I denote a 
line in 1: Xi = + ait, i = 1,..., fc, and let M be a constant. A fc-dimensional cylinder 

C'(/, M) is defined as 

Cil,M) = {x € R'^\dil,x) < M} 

where dil,x) denotes the distance between the point x and the line 1. The line I is the axis of 
the cylinder. 

Now we will give an analogue of the two parallel lines such that the graph lies between those 
two lines and has points on each of those lines with bounded gaps. If the graph Gw of an 
infinite word w over a /c-letter alphabet belongs to some cylinder (7(1, M), we define a tangential 
hyperplane to the graph of Gw as a hyperplane H: hixi b^Xk = d parallel to the line I 

such that for each point x = (xi,..., Xk) G Gw one has bixi bkXk > d, and in addition 

there is a point x' = (x'^,... ,x(j) G Gw such that bix'^ b^x'^. = d. We remark that a 

tangential hyperplane to the graph Gw does not have to be a “proper” tangential hyperplane 
to the cylinder Gil, M), but it is a hyperplane parallel to it. In the remaining part of the text, 
by “tangential” we always mean tangential to the graph. Next, we define a tangential line l! to 
Gw, I' '■ Xi = x'^ + ait, i = 1,..., fe, as a line which belongs to some tangential hyperplane H to 
Gw, H: biXi bkXk = d, such that for each point x G Gw H H one has also x G t. In other 

words, a tangential line t is defined as a line parallel to the cylinder lying in some tangential 
hyperplane, such that the only points of the graph belonging to the hyperplane also belong to 
the line. We remark that this line can be inside the cylinder. 

Now we are ready to state the non-binary analogue of Theorem [2j 

Theorem 3. Let w be an infinite word over a k-letter alphabet. If there exists a constant G 
such that every factor v of w with |x| > G is weakly abelian bordered, then w is bounded weakly 
abelian periodic. Moreover, its graph Gw belongs to a k-dimensional cylinder with axis with 
rational coefficients, and each tangential line to Gw has points of Gw on it with bounded gaps. 

We emphasize that in fact Theorem [2] is a particular case of Theorem [3l We state and prove it 
separately for several reasons. First, for clarity reasons: the statement of Theorem [2] is easier 
and more intuitive. Second, key lemmas in non-binary case are simple consequences of those in 
the binary case. On the other hand, the proof of the body of the theorem is more involved, so 
there is not much repetition in the proofs. 


First we prove that Lemmas [3] and O also hold for the non-binary case: 

Lemma 5. If an infinite word has finitely many weakly abelian unbordered factors, then it is 
balanced. 

Proof. Assume that w is an infinite word with finitely many weakly abelian unbordered factors. 
For a € alph(w), consider a morphism ha', a a, b c for all 6 7 ^ a where c is a letter not 
belonging to alph(w). We call such a morphism a projection. Clearly, if two words u and v 
have the same letter frequencies, then so do ha{u) and ha{v), for each a G alph(w). Hence if 
u is weakly abelian bordered, then so is ha{u). Then ha{w) has finitely many weakly abelian 
unbordered factors. Now, since a projection of tc is a binary word (over the alphabet {a,c}), 
we can apply Lemma O hence ha{w) is also balanced: There exists a constant Ka, such that for 
each two factors u,v of ha{w) such that |rt| = |u| the inequality ||tt|a — |u|a| < Ka holds. Taking 
K = maX(jgs Ka, we get that w itself is iL-balanced by definition. □ 

Lemma 6. If an infinite word has finitely many weakly abelian unbordered factors, then its letter 
frequencies are rational. 

Proof. Clearly, the frequency of any letter a exists in an infinite word x if and only if the 
frequency of the letter a exists in its projection ha{x), and these frequencies are equal. Since 
the projection is a binary word. Lemma 0] implies that w has letter frequencies and they are 
rational. □ 

We let {b,x) denote the scalar product of vectors b and x: {b,x) = 61 X 1 b^Xk- We make 

use of the following analogue of Lemma [2j 

Lemma 7. Let w be an infinite word, and i, j be integers, i < j. If there exists a hyperplane 
H : bixi bf^Xk = d such that Guj{i), G^ij) G H and (b, Gw{l)) > d for each i < I < j (or 

{b, Gwil)) < d for each i < I < j), then the factor w[i + l..j] is weakly abelian unbordered. 

Proof. Two factors w[ii + l..ji] and w[i 2 + l..j 2 ] share the same letter frequencies if the vectors 
Gw{ji) — Gw{ii) and G«,(j 2 ) — Gyj{i 2 ) connecting their ends are collinear. Moreover, they differ 
by a positive multiple. For any proper prefix w\i + l..p] and any proper suffix w[s + l..j] of 
w[i + l..j] we have (G^ip) — Gw{i),b) > 0 and (G«)(j) — Gu,{s),b) < 0, which implies that 
w[i + l..p\ and t(;[s -|- l..j] do not have the same letter frequencies. □ 

Proof of Theorem O Like in the binary case, balance implies that G^ belongs to some k- 
dimensional cylinder G{l,M). The coefficients ai of the axis I of the cylinder are related to 
the letter frequencies and can be defined via {vi,..., v^}. Since frequencies are rational and all 
Vj have integer entries, the coefficients Ui are also rational. In fact, the graph of the word lies on 
finitely many lines with rational coefficients parallel to the cylinder axis. Clearly, in dimension 
two this is equivalent to bounded width. 

First notice that such tangential lines always exist. We can choose a cylinder with the minimal 
M. Similarly to the binary case and using the fact that the slope vector a of the cylinder is 
rational it is not hard to show that there exists a point x' G G^ with d{l,x') = M. Consider 
the line t parallel to I and passing through one of such points, f: x = x' + at. Taking the 
tangential plane H to the cylinder passing through x', we see that t is a tangential line to the 
graph Gw. We remark that this tangential plane H is both a tangential plane to the graph Gw 
and a “proper” tangential plane to the cylinder G{1, M). Although there can be other tangential 
lines to the graph Gw lying inside the cylinder, so that the corresponding tangential planes to 
Gw are not tangential to the cylinder. 


To prove the theorem it remains to show that for a tangential line f: x = x” + at the points of 
the graph lie on this line with bounded gaps. 

Like in the binary case, we hrst show that there are infinitely many points of the graph on the 
line f. Suppose that there are only finitely many of those. Take the corresponding tangential 
hyperplane H": bixi + • • • + b^Xk = d” and consider the hyperplane H' : bixi + • • • + b^x^ = d' 
parallel to H” such that it has inhnitely many points of on it (since lies on finitely many 
lines parallel to f due to rational coefficients, it also lies on finitely many hyperplanes parallel 
to H"). So, there are finitely many points of G^ such that biXi + • • • + bkXk < d', and inhnitely 
many with biXi + • • • + bkXk = d'. 

Now take the point x(^o) = G^(no) on the line with the largest no such that bix[’^°^ + • • • + 
bkX^i^°^ < d'. Then for each n with + ••• + bkX^^'^ = d' for = Gyj{n) the factor 

w[nQ + l..n] is weakly abelian unbordered. Indeed, arguing as in Lemma[71 for any proper prehx 
w[i + l..p] and any proper suffix u)[s + l..j] of w\i + l..j] we have (G^(p) — Gu,{i),b) > 0 and 
b) < 0, which implies that w\i + l..p] cannot be weakly abelian equivalent to 
r(;[s + l..j]. So, we proved that there are inhnitely many points of the graph on a tangential line. 

Finally, we need to prove that the points on the tangential line f appear in bounded gaps. It 
follows from the fact that for two consecutive points ni, n 2 on f the factor w\ni + l..n 2 ] is 
weakly abelian unbordered by Lemma [71 □ 


5 Abelian borders 

In this section we consider the connections between abelian periodicity and the condition of 
having hnitely many abelian unbordered factors. First we show that hnitely many abelian 
unbordered factors is not a sufficient condition for periodicity: 

Proposition 2. There exists an infinite aperiodic word w and constants C, D such that every 
factor V of w with |n| > C has an abelian border of length at most D. 

Proof. Any aperiodic word w € {010100110011, 0101001100110011}^^ satishes the condition with 
C* = 15 and D = 14. The proof is a fairly straightforward and not too technical case study: 

If n is a sufficiently long abelian unbordered factor, then it must begin or end with either 0100 or 
with 1101 because the other factors of length 4 have equal frequencies. So there are only these 
two cases to consider: the factor starts with 0100 (the case of ending with 1101 is symmetric) 
or with 1101 (the case of ending with 0100 is symmetric). 

Consider first the case of a factor beginning with 0100. If it ends with 0, we have an abelian 
border of length 1; if it ends with 01, we have an abelian border of length 2. So we need to 
analyze the remaining case of the form 0100 • • • 0011. 

Considering extension of the prefix 0100 to the right, one has after 01 abelian period 4 with 
equal frequencies. Considering extension of the suffix 0011 to the left, one has abelian period 
4 with equal frequencies. So, extending to the left the suffix, one will get an abelian border as 
soon as there is a block 0101. Thus, there exists an abelian border of length Ak + 2, where k is 
at most 3. 

Now consider the case of a factor beginning with 1101. It continues uniquely to the right with 
01001100, so that 110101001100 is the prefix of the factor. If it ends with 1, we have an abelian 
border of length 1, so we need to consider the cases of ending with 00 and 10. 

If it ends with 00, then it has either a suffix 0100, or 1100. The suffix 0100 extends uniquely 
to the left to 11010100, giving an abelian border of length 8. The suffix 1100 extends to either 


11001100 giving an abelian border of length 8, or to 110101001100, giving an abelian border of 
length 12. 

If it ends with 10, then it has a suffix 010 (otherwise there is an abelian border of length 3), 
which extends uniquely to 1010. This suffix in turn extends to 01010 (otherwise there is an 
abelian border of length 5), which extends to 1101010, giving an abelian border of length 7. 
This concludes the proof. □ 

We observe that all infinite word w € {010100110011,0101001100110011}'^ occurrring in the 
proof of Proposition [2] are abelian periodic with period 2. In general, we do not know whether 
an infinite word containing only finitely many abelian unbordered factors is necessarily abelian 
periodic. However, we begin by showing that abelian periodicity alone does not in general 
imply only finitely many abelian unbordered factors. In other words, at least one direction of 
Theorem 1 fails in the abelian setting. 

Proposition 3. Letx be a uniformly recurrent aperiodic word containing infinitely many distinct 
palindromes. Then x admits an infinite number of abelian unbordered factors. 

Proof. Let N be any positive integer. We claim that x admits a factor of the form allb where a 
and b are distinct letters and U a palindrome with |I7| > N. In fact, since x contains inhnitely 
many palindromes, there exists a palindromic factor u of x with |u| € {N,N + 1}. By uniform 
recurrence there exists a constant M such that every factor of x of length M contains an 
occurrence of u. Since x is aperiodic, there exists a word v with |u| = 3M and distinct letters 
c and d such that both cv and dv are factors of x. In the usual terminology, u is a left special 
factor of X. Let rj denote the center of the first occurrence of u in the prefix of v of length M. 
So rj is either an integer or a half integer depending on the parity of |tt|. Let U be the longest 
palindromic factor of v centered at rj. It follows that 2M > \U\ > |n| > N. If [/ is a not a prefix 
of V, then there exist distinct letters a and b such that the word allb is a factor of v centered 
at Tj. On the other hand, if 17 is a prefix of v, then let 6 be a letter such that Ub is a prefix 
of V. Since both cv and dv are factors of x we can find a letter a different from b such that 
allb is a factor of x. Having established our claim, the result of the proposition now follows 
immediately since any word of the form aUb with a b and U a palindrome is clearly abelian 
unbordered. □ 

As an immediate corollary we have: 

Corollary 2. There exist abelian periodic words having infinitely many abelian unbordered fac¬ 
tors. 

Proof. Let x € {0, l}'^ be any binary uniformly recurrent aperiodic word containing infinitely 
many palindromes. For instance x can be taken to be the Thue-Morse infinite word or any 
Sturmian word. Let r denote the morphism 0 i—>• 0110, 1 1001. Then t{x) is abelian 

periodic (with abelian period 2). Moreover, it is readily verified that t{x) is uniformly recurrent, 
aperiodic, and contains infinitely many distinct palindromes. By Proposition [3l it follows that 
r(x) admits inhnitely many abelian unbordered factors. □ 

As mentioned earlier, we do not know whether the other direction of Theorem [1] holds in the 
abelian context, that is whether a word having only hnitely many abelian unbordered factors 
is necessarily abelian periodic. However we are able to establish the abelian analogue of the 
following weaker version of Theorem (T) 

Theorem 4. Let x be an infinite word having only finitely many unbordered factors. Then there 
exists a constant N such that x contains at most N factors of each given length n > 1. In other 
words, x has bounded factor complexity. 


The reason this is a weaker version of Theorem [T] is because bounded complexity is equivalent 
to ultimate periodicity. For this reason the converse of Theorem 0] does not hold in general. 
For instance x = 01111 •• • has bounded complexity, yet contains infinitely many unbordered 
prefixes. 

We now give several new proofs of Theorem [1] which may be of independent interest. One 
direction is of course trivial: namely, if x is purely periodic then all sufficiently long factors of 
X are bordered. For the other direction, as in the original proof by Ehrenfeucht and Silberger, 
we break the proof into two parts; in a first part we show that if x is any infinite word with the 
property that all sufficiently long factors are bordered, then x is ultimately periodic. We give 
three proofs of this first part which to our knowledge are all new. However, each of the following 
proofs relies on one or more key features which do not extend to the abelian context. See the 
remark following the proofs. In a second part we show that x must be purely periodic. Also 
our proof of the second part is new and greatly simplifies the original proof by Ehrenfeucht and 
Silberger. 

Proof of TheoremUl Let U{x) denote the set of unbordered factors of x. Clearly if x is purely 
periodic then U{x) is finite. Conversely, assume U{x) is finite. We hrst show that x is ultimately 
periodic. 

A first quantitative proof of this was given by K. Saari in which he shows that for any aperiodic 
word X and any positive integer n, the number Cx{n) of Lyndon words of length < n occurring 
in X is greater or equal to Cf{n) where f is the Eibonacci word. See Theorem 3 and Remark 3 
in [TTj . 

A second proof, also using Lyndon words proceeds as follows: If x is aperiodic, then by Propo¬ 
sition 4.8 in [1], X contains an infinite Lyndon word x' in its shift orbit closure, i.e., there exists 
an ordering of the alphabet of x relative to which x' is lexicographically smaller than all its 
proper suffixes. Since x' begins in infinitely many unbordered prefixes, it follows that x contains 
infinitely many unbordered factors. 

A third proof uses the so-called Duval conjecture, first proved by the second author together 
with D. Nowotka in [7], which states that if u is a word of length 2n such that the prefix tt of u of 
length n is unbordered and every factor of v of length greater than n is bordered, then v = uu. 
Assume U{x) is finite and let u € U{x) be of maximal length. Then by the Duval conjecture, 
every occurrence of tt in x is an occurrence of Hence x is ultimately periodic with period u. 

A fourth proof, perhaps the simplest in that it is completely self contained, proceeds as follows: 
Assume b({x) is hnite. By replacing x if necessary by some suffix of x, we can assume that each 
u G U{x) is recurrent in x, i.e., occurs in x an infinite number of times. Let u G U{x) be of 
maximal length. Let v be any first return to u in x, i.e., vu is a factor of x which begins and ends 
in u and \vu\u = 2. Then since u is unbordered, |x| > |u| i.e., u is a prefix of v. If |x| > |u|, then 
V is bordered (by maximality of \u\). Let v' denote the shortest border of v. Then v' G U{x) 
from which it follows that |u'| < |u| and hence v' is a proper prefix of u (since u is not a suffix 
of v). But then the factor v'u of x is an unbordered factor of x of length greater than |ri|, a 
contradiction. Hence, if u is a first return to u, then |u| = |u|, i.e., v = u. It follows that x is 
ultimately periodic with period u. 

We next show that if x = X 1 X 2 X 3 • • • is ultimately periodic and all sufficiently long factors of x 
are bordered, then x is purely periodic. Let x' denote the left infinite word • • • X 3 X 2 X 1 obtained 
by reflecting x. By assumption there exists a primitive word u such that is a prefix of x' 
(where ‘^u denotes the left infinite word • • • uuu). Clearly if tt = a for some letter a, then x' = ‘^a. 
Thus we can assume that u contains at least two distinct letters. Let < be a linear order of 
alph(x) and let <’ denote the opposite order, i.e., a <' 6 if and only if 6 < a. Let M (m, 
respectively) denote the Lyndon conjugate of u relative to < (<', respectively). Without loss 
of generality we can write x' = ^Mz = ^myz for some suffix 2 ; of x' and some suffix y of M. 


We will show that z is a prefix of M'^ and hence that x' is purely periodic. Let denote the 
prefix of of length \z\ and niyz the prefix of of length \yz\. Since M is unbordered and 
all sufficiently long factors of x' are bordered, it follows that z is a product of prefixes of M 
and hence that z < Mz- Similarly, yz <' niyz or equivalently yz > niyz- Since = yM‘^, we 
deduce that > Mz and hence z = Mz as required. □ 

Remark 2. We note that none of the above proofs readily extend to the abelian context. In 
fact, we know of no suitable analogue of Lyndon words in the abelian setting. Also, the Duval 
conjecture fails miserably in the abelian setting. More precisely, there exists an inhnite word 
X with the following properties: i) x begins in an unbordered factor u, ii) all factors of x of 
length greater than |tt| are abelian bordered, iii) the prefix of x of length 2|rt| is not an abelian 
square. To see this, let y be any infinite word in {010100110011, 0101001100110011}^ beginning 
in (0101001100110011)^. Let x denote the second shift of y, i.e., x = (01)“^?/. Then x begins in 
the unbordered factor u = 01001100110011 of length 14. As in the proof of Proposition [21 every 
factor of X of length greater than 14 is abelian bordered. Yet it is easily checked that the prefix 
of length 28 of x is not an abelian square, i.e., not of the form uu' with u' abelian equivalent 
to u. Finally, the fourth proof uses the following fact which is no longer true in the abelian 
setting, namely that if u is a proper prefix of an unbordered word u, then vu is unbordered. 
For instance, we can take the prefix u = 01 of the abelian unbordered word u = 010011. Yet 
vu = 01010011 is abelian bordered. 

We end by establishing the following abelian analogue of Theorem HJ 

Theorem 5. Let x be an infinite word having only finitely many abelian unbordered factors. 
Then there exists a constant N such that x contains at most N abelian equivalence classes of 
factors of each given length n > 1. In other words, x has bounded abelian factor complexity. 

Proof. Since the condition that x has only finitely many abelian unbordered factors implies that 
x has only finitely many weakly abelian unbordered factors, it follows that all conclusions we 
derived earlier in the weak abelian setting still hold. In particular x is balanced. Thus x has 
bounded abelian complexity (see Lemma 3 in HD]). □ 

6 A non-abelian periodic word with bounded abelian square at 
each position 

In [T], the following open question was proposed: Let w be an infinite word and C be an integer 
such that each position in tc is a centre of an abelian square of length at most C. Is w abelian 
periodic? We answer this question negatively by providing an example (actually, a family of 
examples). 

Consider a family of infinite words of the following form: 

( 000101010111000111000 ( 111000 )* 111010101 )'^ 

A straightforward case study shows that words of this form have an abelian square of length at 
most 12 at each position. It is not hard to see that this family contains abelian aperiodic words. 


7 Conclusions and open questions 


We conclude with the summary of our results and propose two open problems. 


Question 1. Let w he an infinite word and C a constant such that every factor v of w with 
|u| > C is abelian bordered. Does it follow that w is abelian periodic? 

Recall that we showed that there exist examples of words satisfying these conditions which are 
not periodic (Proposition [2]), but all examples we have are abelian periodic. 

The next question asks whether the converse of Theorem [3] (and in particular Theorem [2|) is 
true: 

Question 2. Let w be an infinite bounded weakly abelian periodic word over a k-letter alphabet 
such that its graph Gw belongs to a k-dimensional cylinder with axis with rational coefficients, 
and each tangential line to Gw has points of Gw on it with bounded gaps. Does it follow that w 
has only finitely many weakly abelian unbordered factors? 

We remark that in the case of binary alphabets and letter frequencies equal to 1/2 the answer 
to this question is positive (see Remark after Theorem [5]). 

Our main results and open questions are summarized in Figure 1. 
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Figure 4: Results and open questions. WAP means “weakly abelian periodic”. Condition 1: The 
graph Gw of the word belongs to a fc-dimensional cylinder with axis with rational coefficients, 
and each tangential line to Gw has points of Gw on it with bounded gaps. 
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